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PROCESS FOR PRODUCING CHEMICALLY DIVERSIFIED 
PROTEINS, IN VIVO, BY INCORPORATING UNCONVENTIONAL 



allow prokaryotic or eukaryotic cells to acquire the 
capacity to produce proteins the amino acid sequences 
of which comprise at least one unconventional amino 
acid, to methods for selecting said cells, to methods 
for producing and for purifying said proteins, and also 
to the cells and proteins obtained using the methods 
and processes according to the invention. The invention 
also comprises the uses of said cells and proteins in 
various domains, such as the therapeutic, cosmetic or 
diagnostic domain or the domain of biosynthesis, or the 
biodegradation of organic compounds. 



mass-produced using recombinant organisms are employed 
as catalysts in the chemical industry or as therapeutic 
agents. The search for novel proteins with diversified 
functions is the subject of intense activity, either by 
screening the proteins of extremophilic organisms, or 
by creating protein variants by mutagenesis and 
screening. However, the chemical variability of the 
proteins which can be produced in living organisms 
remains limited by the invariance of the genetic code, 
i.e. restricted to combinations of a canonical set of 
2 0 amino acids. If descendence of natural species could 
be progressively remodeled in the laboratory so as to 
adopt various genetic codes, protein evolution could be 
redirected and artificial sources of biodiversity 
thereby established . 



the only path which would make it possible to overcome 
this limitation. An alternative genetic code might 
specify a smaller or larger set of amino acids, a set 
substituted with noncanonical monomers or a set of 
canonical amino acids among which the codons are 
redistributed. The specification of additional amino 



AMINO ACIDS 



The present invention relates to methods which 



An increasing number of proteins which are 



Experimental deviation of the genetic code is 



acids in living lines would lend itself to many uses, 
the most generic of which would be the establishment of 
artificial biodiversity. 

The permanent or transient incorporation of a 
single additional amino acid, carrying a chemical motif 
which could react without modifying the conventional 
amino acids, would be sufficient to establish novel 
protein f unctionalization methods. This is precisely 
the subject of the present invention. 

A subject of the invention is a method which 
allows cells to acquire the capacity to produce a 
protein the amino acid sequence of which comprises at 
least one unconventional amino acid, characterized in 
that it comprises the following steps: 

a) transforming said cells by at least one introduction 
of a missense mutation in a target codon of a gene 
encoding a protein required for the growth of said 
cells, said protein synthesized from the gene thus 
mutated no longer being functional; 

b) where appropriate, culturing the cells obtained in 
step a) in a culture medium containing the nutrient 
required by the loss of functionality of said 
protein thus mutated; and 

c) culturing the cells obtained in step a) or b) in a 
culture medium containing the amino acid encoded by 
said target codon. 

In the present description, the term "protein" 
will be intended to also refer to peptides or poly- 
peptides, and also to the corresponding glycoproteins 
when said proteins are glycosylated. 

In the present description, the term 
"unconventional amino acid" will be intended to also 
refer to any amino acid other than the amino acids 
incorporated by ribosomes during the biosynthesis of 
proteins synthesized by prokaryotic or eukaryotic, 
unicellular or multicellular organisms, and also to any 
amino acid incorporated in place of the amino acid 
which should normally be incorporated at this site with 
regard to the translated nucleic acid sequence. 
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In the present description, the term "missense 
mutation" will be intended to also refer to a mutation 
which transforms a codon which represents an amino acid 
into a codon which encodes another amino acid, the 
5 latter, where appropriate, being unable to replace the 
amino acid of origin so as to give a functional 
protein, in the protein at the site of the amino acid 
residue of origin. 

In the present description, the expression 
10 "protein required for the growth of cells" will be 
intended to also refer to a protein which, when it is 
synthesized by cells in a functional manner, allows 
said cells to grow under given culture conditions and 
g which, when it is synthesized by cells in a 

15 nonfunctional manner, requires the introduction of an 
[Vj additional nutrient into said given culture medium in 

D order to allow said cells to grow. Such nonfunctional 

m 

i proteins can, for example, be synthesized by cells 

C- subsequent to conditional mutations such as a 

20 photosensitive- type mutation. 
J5 In order to illustrate with an example, but 

jjf without being limited thereto, mention may be made in 

Fi particular of the thymidylate synthase protein of 

H= E. coli, which has a catalytic site occupied by 

25 cysteine at position 146 of its amino acid sequence, 
and the corresponding mutations of the gene ( thyA) of 
which cause a nutritional requirement for thymine or 
thymidine, no other amino acid being able to replace 
cysteine at this site. 
3 0 In the present description, the term "target 

codon" will be intended to refer to the three- 
nulceot ide-base codon transformed by the missense 
mutation. 

The invention also comprises a method according 
35 to the invention, characterized in that the culture 
medium of step c) does not contain the nutrient 
required by the loss of functionality of said mutated 
protein . 
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According to the invention, step c) for 
culturing said cells can comprise a series of cultures 
of said cells in a culture medium containing the amino 
acid encoded by said target codon, each of said 
5 cultures of the series being prepared as far as 
obtaining the stationary growth phase and followed by 
washing of the cells obtained, the number of cultures 
of the series being sufficient to allow the selection 
of mutations which increase the suppression of said 
10 missense mutation of said mutated gene, and the 
propagation of the allele corresponding to said mutated 
gene . 

The invention also relates to a method 
p according to the invention, characterized in that the 

jD 15 missense mutation is chosen from missense mutations 

Cm 

jjj which spontaneously reverse only at very low frequency, 

O of the order of one organism from at least 10 15 . 

Preferably, the missense mutation will be 
C? chosen from missense mutations which transform a target 

2 0 codon of a gene encoding a protein required for the 
42 growth of said cell, into a codon which, in comparison 

jif with the target codon, exhibits a change of at least 

q two bases, more preferably three bases. 

M= Preference is also given to the methods 

25 according to the invention, characterized in that the 
target codon encodes an amino acid which has a small 
steric volume and/or which is amphiphilic and/or which 
has a steric volume smaller than or substantially equal 
to the steric volume of the amino acid encoded by the 
30 missense mutation. 

Among the target codons, preference is in 
particular given to target codons encoding cysteine and 
missense mutations chosen from missense mutations which 
transform a target codon into a codon encoding valine 
35 or isoleucine. 

The invention also relates to a method 
according to the invention, characterized in that 
step a) for transforming said cells is carried out 
using a vector comprising a sequence of said gene 
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encoding a protein required for the growth of said 
cells, including said missense mutation, in particular 
using a plasmid vector. 

Such vectors will be prepared according to 
5 methods commonly used by those skilled in the art, and 
the clones resulting therefrom can be introduced into 
said cells by usual methods of genetic recombination, 
such as for example lipofection, electroporat ion or 
heat -shock . 

10 In another aspect, a subject of the invention 

is a method for selecting cells capable of producing a 
protein the amino acid sequence of which comprises at 
least one unconventional amino acid, characterized in 
that it comprises steps a) , where appropriate b) , and 

15 c) of a method according to the invention, and 
selecting the cells capable of growing in step c) . 

Preferably, the method for selecting cells 
according to the invention will also comprise a step d) 
for culturing the cells in step c) in a culture medium 

2 0 containing said amino acid encoded by said target 
codon, the concentration of said amino acid possibly 
being at a concentration higher than the concentration 
of said amino acid used in step c) , and choosing the 
cells sensitive to the concentration of said amino acid 

25 used in step d) . 

The expression "cell sensitive to a chemical or 
biochemical compound or to a given concentration of 
said compound" is intended to refer to a cell the 
growth of which is partially or totally inhibited when 

30 it is cultured in a culture medium containing said 
chemical or biochemical compound or said concentration 
of said compound. 

The invention also comprises a method for 
selecting cells according to the invention, 

35 characterized in that the aminoacyl - tRNA synthetase 
which recognizes the amino acid encoded by said 
missense mutation of said selected cells is capable of 
charging onto one of its associated tRNAs an 
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unconventional amino acid or an amino acid other than 
said amino acid encoded by said missense mutation. 

In the present description, the term 
"associated tRNA" is intended to refer to a tRNA which 
5 is recognized by the aminoacyl - tRNA synthetase which 
recognizes an amino acid, and which can transfer said 
amino acid. 

The invention also comprises a method for 
selecting mutant cells according to the invention, 
10 characterized in that the nucleic acid sequence of the 
gene encoding said aminoacyl -tRNA synthetase includes 
at least one mutation compared with the sequence of the 
corresponding wild- type gene, said mutation not having 
O been introduced by a technique of genetic 

L ff 15 recombination. 

jVj According to another aspect, a subject of the 

Q invention is the prokaryotic or eukaryotic cells 

obtained using a method according to the invention. 
■Q Among the cells which can be used for these 

2 0 purposes, mention may of course be made of bacterial 
Jp cells, such as E. coli , but also yeast cells, as well 
^ as animal cells, in particular mammalian cell cultures, 
g such as in particular Chinese hamster ovary (CHO) 
M= cells, and also insect cells. 

25 The invention also relates to the isolated 

prokaryotic or eukaryotic cells capable of producing a 
protein the amino acid sequence of which comprises at 
least one unconventional amino acid, characterized in 
that they comprise an aminoacyl -tRNA synthetase which 

3 0 recognizes a given amino acid and which is capable of 

charging onto one of its associated tRNAs an 
unconventional amino acid or an amino acid other than 
said given amino acid, and in that the nucleic acid 
sequence of the gene encoding said aminoacyl -tRNA 
35 synthetase includes at least one mutation compared with 
the sequence of the corresponding wild- type gene, said 
mutation not having been introduced by a technique of 
genetic recombination . 



n 
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Thus, the invention relates to a method for 
selecting cells, based on the constitution, by the 
cell, of a metabolic pathway required for its growth, 
making it possible to obtain cells capable of producing 
5 a noncanonical acyl-tRNA capable of charging an 
unconventional amino acid. 

Among the cells according to the invention, 
preference is given to bacterial cells characterized in 
that they are chosen from the following cells deposited 
10 at the CNCM (Collection Nationale de Culture de 
Microorganismes [National Collection of Microorganism 
Cultures] , Paris, France) : 







a) 


E. 


coli strain deposited 


at 


the 


CNCM 


under 


the 


o 






No. 


1-2025 on May 25, 1998, 














15 


b) 


E. 


coli strain deposited 


at 


the 


CNCM 


under 


the 








No. 


1-2026 on May 25, 1998, 
















c) 


E. 


coli strain deposited 


at 


the 


CNCM 


under 


the 








No. 


1-2027 on May 25, 1998, 
















d) 


E. 


coli strain deposited 


at 


the 


CNCM 


under 


the 


Q 


20 




No. 


1-2339 on October 26, 1999, 














e) 


E. 


coli strain deposited 


at 


the 


CNCM 


under 


the 








No. 


1-2340 on October 26, 1999, 


and 












f ) 


E . 


coli strain deposited 


at 


the 


CNCM 


under 


the 








No. 


1-2341 on October 26, 1999. 












25 






The E. coli strain K12, deposited at 


. the 


CNCM 



under the No. 1-2025 and identified under the reference 
P5366, is a descendent of the strain MG1655 (wt E . coli 
K12), which includes the following characteristics: 

- deletion at the thyA locus and replacement 
3 0 with an erythromycin resistance gene, 

- carries a plasmid pTZ18 (col El replicon, 
bla + ) with the Cysl46GUA allele of thymidylate 
synthase . 

The E. coli strain K12, deposited at the CNCM 
35 under the No. 1-2026 and identified under the reference 
p8144, is a descendant of the strain MG1655 (wt E. coli 
K12) , which includes the following characteristics: 

- deletion at the thyA locus and replacement 
with an erythromycin resistance gene, 
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- carries a plasmid pTZ18 (col El replicon, 
bla + ) with the Cysl46GUA allele of thymidylate 
synthase . 

The E. coli strain K12 , deposited at the CNCM 
5 under the No. 1-2 027 and identified under the reference 
p8146, is a descendant of the strain MG1655 SWT E. coli 
K12), which includes the following characteristics: 

- deletion at the thyA locus and replacement 
with an erythromycin resistance gene, 

10 - carries a plasmid pTZ18 (col El replicon, 

bla + ) with the Cysl46GUA allele of thymidylate 
synthase . 

The E. coli strain K12, deposited at the CNCM 
Q under the No. 1-2339 and identified under the reference 

rf 15 p5479, is a descendant of the strain MG1655 (wt E. coli 
bj K12), which includes the following characteristics: 

Jjf - deletion at the thyA locus and replacement 

07 with an erythromycin resistance gene, 

S - deletion at the nrdD locus and replacement 

f~i 2 0 with a kanamycin resistance gene, 

=P - carries the R223H allele of the valS gene, 

25 - carries a plasmid pTZ18 (col El replicon, 

□ bla + ) with the Cysl46GUA allele of thymidylate 

^ synthase . 

25 The E . coli strain K12, deposited at the CNCM 

under the No. 1-2340 and identified under the reference 
P5485, is a descendant of the strain MG1655 (wt E. coli 
K12), which includes the following characteristics: 

- deletion at the thyA locus and replacement 
3 0 with an erythromycin resistance gene, 

- deletion at the nrdD locus and replacement 
with a kanamycin resistance gene, 

- carries the Val 276 Ala chromosomal allele of 
the valS gene, 

35 - carries a plasmid pTZ18 (col El replicon, 

bla + ) with the Cysl46GUA allele of thymidylate 
synthase . 

The E. coli strain K12 , deposited at the CNCM 
under the No. 1-2341 and identified under the reference 



P5486, is a descendant of the strain MG1655 (wt E. coli 
K12) , which includes the following characteristics: 

- deletion at the thyA locus and replacement 
with an erythromycin resistance gene, 

- deletion at the nrdD locus and replacement 
with a kanamycin resistance gene, 

- carries the Asp 23 0 Asn chromosomal allele of 
the valS gene, 

- carries a plasmid pTZ18 (col El replicon, 
bla + ) with the Cysl46GUA allele of thymidylate 
synthase . 

The invention also comprises the use of a 
method or of a cell according to the invention for 
producing protein, in particular recombinant protein, 
the amino acid sequence of which comprises at least one 
unconventional amino acid. 

In another aspect, the invention relates to a 
process for producing a protein the amino acid sequence 
of which comprises at least one unconventional amino 
acid, characterized in that it comprises the following 
steps : 

a) where appropriate, selecting a cell by a method 
according to the invention; 

b) culturing said cell selected in step a) or a cell 
according to the invention in a culture medium and 
under culture conditions which allow the growth of 
said cell; and 

c) isolating said protein comprising at least one 
unconventional amino acid from the culture 
supernatant and/or from the cell pellet obtained in 
step b) . 

Among the proteins which can be produced by a 
process according to the invention, mention may be 
made, but without being limited thereto, of proteins 
which, through the incorporation of at least one 
unconventional amino acid, make it possible to obtain a 
desired activity which a protein the sequence of which 
includes only conventional amino acids does not make it 
possible to obtain. The term "activity" is intended to 
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refer, in general, to any activity such as a 
physiological or biological activity, even partial, 
relating to unicellular or multicellular organisms, 
such as for example a structural or biochemical 
5 activity, for example an enzymatic or antigenic 
activity, an activity of antibody type, or an activity 
which modulates, regulates or inhibits biological 
activity, or such that it allows the implementation 
thereof in a process for biosynthesizing or for 
10 biodegrading chemical or biochemical compounds. 

Among the proteins which can be produced by a 
process according to the invention, mention may also be 
made of proteins for which the incorporation of at 
O least one unconventional amino acid is carried out such 

r: 15 that there results therefrom no substantial 
Ly modification of the biological activity of the 

jf corresponding unmodified protein. Besides the conserved 

rh biological activity of the corresponding unmodified 

protein, these proteins according to the invention will 
m 2 0 have an unconventional amino acid with specific 
«P properties which may be advantageously exploited. 

:Sf Among the specific properties conferred by the 

O presence of an unconventional amino acid, mention may 

^ be made in particular of the properties linked to the 

2 5 presence of a functional group on said unconventional 

amino acid, capable of reacting easily and specifically 
with a chemical or biochemical compound under 
conditions which make it possible not to modify the 
activity of the protein or which avoid modifying the 

3 0 conventional amino acids. 

The presence of this specific functional group 
may advantageously be used, for example, for: 

(i) purifying any protein, in particular any 
recombinant protein, which incorporates said 

3 5 unconventional amino acid; 

(ii) coupling such a protein to a solid support; 

(iii) coupling to such a protein molecules capable of 
being detected, such as spectroscopic probes of varied 
nature ; 



(iv) coupling to such a protein lipophilic or hydro- 
philic polymers which allow the solubilization thereof 
in solvents or which allow masking against recognition 
by antibodies; 

(v) coupling such a protein to a polynucleotide; 

(vi) coupling such a protein to a chemical or 
biochemical compound the presence of which makes it 
possible to increase, to decrease, to modify, to 
regulate or to target the biological activity of said 
protein, or to modify the bioavailability thereof as a 
compound for therapeutic use; or 

(vii) permanently attaching to such a protein a 
coenzyme which otherwise would diffuse in solution. 

According to the present invention, the 
incorporation of at least one unconventional amino acid 
may relate to amino acids responsible for specificity 
or for the activity, or responsible for the structural 
conformation, for the charge, for the hydrophobicity or 
for the multimerization capacity of the corresponding 
unmodified protein. Thus, proteins of equivalent, 
increased or decreased activity, or of specificity 
which is equivalent, more restricted or broader than 
the corresponding unmodified protein containing 
conventional amino acids may be created. 

The term "unmodified protein" is intended to 
refer to the wild-type or recombinant protein which 
consists of conventional amino acids, and from which is 
derived the protein comprising the unconventional amino 
acid . 

Preferably, the production process according to 
the invention is characterized in that said culture 
medium of step b) which allows the growth of said cell 
contains said unconventional amino acid or a precursor 
thereof . 

According to one particular embodiment, a 
production process according to the invention is 
characterized in that said unconventional amino acid is 
synthesized by said cell, the synthesis of said 



- 12 - 

unconventional amino acid possibly being increased by 
genetic modification of said cell. 

The invention also relates to a process for 
producing a protein the amino acid sequence of which 
comprises at least one unconventional amino acid 
according to the invention, characterized in that said 
cell is auxotrophic for the amino acid encoded by said 
target codon. 

Also included in the present invention are the 
processes according to the invention, characterized in 
that said cell comprises a homologous or heterologous 
gene of interest the coding sequence of which includes 
at least one target codon. 

In general, the gene of interest will encode a 
messenger RNA which will then be translated into a 
protein of interest. 

The gene of interest can be isolated by any 
conventional technique, such as cloning or PCR 
(Polymerase Chain Reaction) , or chemically synthesized. 
It can be of genomic (possessing one or more introns) 
or complementary DNA (cDNA) type. The protein of 
interest can consist of a mature protein, a precursor, 
and in particular a precursor intended to be secreted 
and comprising a signal peptide, a truncated protein, a 
chimeric protein originating from the fusion of 
sequences of diverse origins, or a mutated protein 
having improved and/or modified biological properties. 

In general, the homologous or heterologous gene 
of interest may be chosen from genes encoding any 
protein which can be used as a therapeutic or cosmetic 
compound, or as a diagnostic reagent or as a compound 
which can be used in a biosynthesis or biodegradat ion 
process . 

By way of examples, mention may be made of 
genes of interest encoding the following proteins of 
interest : 

- cytokines or lymphokines (a-, p- and y-interf erons, 
interleukins, and in particular IL-2, IL-6, IL-10 or 
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IL-12, tumor necrosis factors (TNFs) , colony 
stimulating factors (GM-CSF, C-CSF, M-CSF, etc.)); 

- cell or nuclear receptors, in particular those 
recognized by pathogenic organisms (viruses, bacteria 
or parasites), or the ligands thereof; 

- proteins involved in a genetic disease (factor VII, 
factor VIII, factor IX, dystrophin or minidystrophin, 
insulin, CFTR (Cystic Fibrosis Transmembrane 
Conductance Regulator) protein, growth hormones 
(hGH) ) ; 

- enzymes (urease, renin, thrombin, etc.) or any 
enzymes involved in the metabolism or biosynthesis of 
proteins, of lipids, of nucleic acids, of sugars, of 
amino acids, of fatty acids or of nucleotides ; 

- enzyme inhibitors (a- 1-antitrypsin, antithrombin III, 
viral protease inhibitors, etc.); 

- compounds with an antitumor effect, capable of 
inhibiting at least partially the initiation or 
progression of tumors or cancers (antibodies, 
inhibitors acting on cell division or on transduction 
signals, tumors suppressor gene expression products, 
for example p53 or Rb, proteins which stimulate the 
immune system, etc . ) ; 

- class I or II major histocompatibility complex 
proteins, or regulatory proteins which act on the 
expression of the corresponding genes 

- proteins capable of inhibiting a viral, bacterial or 
parasitic infection or the development thereof 
(antigenic proteins having immunogenic properties , 
antigenic epitopes, antibodies, etc.); 

- toxins such as ricin, cholera, diphtheria, etc. 
toxins , or immunotoxins ; 

- markers (P-galacatosidase , peroxidase, etc.); and 

- lucif erase, GFP (green fluorescent protein), etc. 

The invention also comprises a process for 
producing a protein according to the invention, 
characterized in that the culture medium of step b) 
also comprises the compounds required for inducing the 
synthesis of the protein encoded by said gene of 
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interest. These compounds are known to those skilled in 
the art and depend, in particular, on the cell and on 
the homologous or heterologous gene selected. 

The invention also relates to a process 
according to the invention, characterized in that the 
biological activity of the protein encoded by said gene 
of interest is at least partially conserved after the 
incorporation of said unconventional amino acid at the 
level of the target codon of said gene of interest. 

The invention also relates to a process 
according to the invention, characterized in that the 
unconventional amino acid is chosen from unconventional 
amino acids of formula I and of configuration L 

COOH 

I 

RrOH 
H 

/ 
N 

\ 

R 2 (I) 

in which: 

Ri or R 2 represents radicals containing a functional 
group capable of reacting selectively, preferably 
chosen from aldehyde, ketone, ethenyl, ethynyl and 
nitrile groups . 

Among these groups, preference is given 
particularly to the oxo (aldehyde or ketone) group with 
selective reactivity which would facilitate the 
chemical f unctionalization of the proteins. Other 
simple groups , such as the ethynyl group, would also 
lend themselves to selective reactions. A vast 
collection of experiments conducted with the aid of 
systems of acellular (ex vivo) translation and of 
acyl-tRNAs synthesized in vitro has demonstrated that a 
large variety of acyl groups can be transferred on the 
ribosome in response to a codon read by the tRNA. 
Briefly, lateral modifications of amino acids all 
appear to be compatible with translation (to date, no 
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amino acid having a side chain which would be bulky- 
enough to block translation has been found) ; 
substitutions of the amino motif to an alkylamino, to a 
hydroxyl and to a hydrazino motif are compatible with 
5 ribosome-catalyzed transpeptidat ion chemistry 

(Bain et al . 1991) (it is known that the ribosome can 
form polyesters in addition to conventional 
polyamides) ; the substitution of the alpha hydrogen of 
the H 2 NCH (R) -COOH motif with an alkyl (methyl) group, 
10 or the inversion of configuration at the alpha carbon 
(D-amino acids) is not, on the other hand, accepted by 
the ribosome . 

A subject of the invention is also a process 
according to the invention, for protein 

15 f unctionalization . 

The invention also relates to a protein 
purification process, characterized in that it 
comprises the following steps: 

a) incorporating into the amino acid sequence of said 

2 0 protein an unconventional amino acid containing a 

functional group capable of reacting selectively, 
using a process according to the invention; 

b) bringing the solution containing the protein 
obtained in step a) into contact with a support 

25 comprising a compound capable of reacting 

specifically with said functional group and of 
attaching specifically said protein; and 

c) isolating said protein attached to the support. 

The processes for purifying protein, which may 

3 0 be natural or recombinant, conventionally used by those 

skilled in the art generally employ methods used 
individually or in combination, such as fractionation, 
chromatography methods, immunoaf f inity techniques using 
specific mono- or polyclonal antibodies, etc. These 
35 methods are sometimes long and tedious and do not 
always make it possible to obtain the specific 
activity, the level and the yield of purification 
desired. The presence of a specific functional group on 
the protein to be purified, which group is capable of 
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reacting selectively with the purification support 
without modifying the activity of the protein, would 
greatly facilitate the purification of protein required 
for the use thereof . 
5 The invention also relates to a process for 

attaching a protein to a chemical or biochemical 
compound, characterized in that it comprises the 
following steps: 

a) incorporating into the amino acid sequence of said 
10 protein, by a process according to the invention, 

an unconventional amino acid containing a 
functional group capable of reacting selectively; 

b) bringing the protein obtained in step a) into 
contact with said chemical or biochemical compound 

15 comprising a group capable of reacting 

specifically with said functional group in a 
medium allowing the reaction. 

Preferably, the attachment of a protein to a 
chemical or biochemical compound is an attachment by 
2 0 covalent bonding. 

The chemical or biochemical compounds which can 
be used in said attachment process according to the 
invention may be chosen from all compounds capable of 
reacting with the functional group of the incorporated 

2 5 unconventional amino acid. 

In the present description, the term "protein 
complex" is intended to refer to the product obtained 
in step b) of the process described above, comprising a 
protein according to the invention attached to a 

3 0 chemical or biochemical compound. 

The invention also comprises a process 
according to the invention, characterized in that said 
chemical or biochemical compound is, itself, attached 
to a solid support or is a constituent compound of a 
3 5 solid support. 

The invention also relates to a process 
according to the invention, for preparing a protein 
complex. 
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Preferably, the invention comprises the 
processes of the invention, characterized in that said 
attached protein or said chemical or biochemical 
compound is chosen from therapeutic, cosmetic or 
5 diagnostic compounds. 

Said attached protein will be chosen, in 
particular, from proteins the amino acid sequence of 
which comprises an unconventional amino acid according 
to a process of the invention, and for which the 
10 corresponding wild-type or recombinant unmodified 
protein is chosen from proteins which can be used as 
therapeutic or cosmetic compounds or as diagnostic 
reagents . 

q Preferably, the processes according to the 

J3 15 invention are characterized in that the chemical or 
biochemical compound is chosen from compounds capable 
O of modifying the biological activity of the attached 

~ s protein. 

yy The expression "compounds capable of modifying 

2 0 the biological activity of another compound" is 
jg intended to refer to a compound capable of increasing, 

W of decreasing or of regulating the biological activity 

C3 

of said other compound. 
M= The invention also comprises a process 

25 according to the invention, characterized in that the 
chemical or biochemical compound is chosen from 
compounds the biological activity of which can be 
modified by the attached protein. 

The invention also comprises a process 
30 according to the invention, characterized in that the 
chemical or biochemical compound is chosen from 
compounds comprising a protein, a polynucleotide, a 
fatty acid, a sugar or a natural or synthetic polymer. 

According to another aspect, a subject of the 
35 invention is the proteins, in particular the 
recombinant proteins, and the protein complexes 
obtained by a process according to the invention. 

The invention also comprises a method for 
selecting compounds capable of binding to a protein 
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according to the invention or capable of binding to the 
chemical or biochemical compound of the protein complex 
according to the invention. Among these methods, 
mention may be made, as an example, of a method 
5 characterized in that it comprises the following steps: 

a) bringing said compound which may be selected 
into contact with the protein or the protein complex 
according to the invention, said protein or protein 
complex possibly being in particular attached to a 

10 solid support; 

b) determining the capacity of said compound to 
bind with the protein or the protein complex according 
to the invention. 

The compounds which may be selected can be 

15 organic compounds, such as proteins or carbohydrates, 
or any other already known organic or inorganic 
compounds, or novel organic compounds developed using 
molecular modeling techniques and obtained by chemical 
or biochemical synthesis, these techniques being known 

20 to those skilled in the art. 

The cells according to the invention may also 
advantageously serve as a model, and be used in 
processes for studying, identifying and/or selecting 
proteins according to the invention or compounds which 

25 may possess a desired activity. 

The invention also relates to the use of a 
protein or of a protein complex according to the 
invention as a diagnostic reagent, and also to the 
diagnostic processes, in particular for specifically 

30 detecting, identifying, locating and/or assaying 
polypeptides or polynucleotides, using a protein or a 
protein complex according to the invention. 

Proteins having incorporated at least one 
unconventional amino acid and having partially or 

35 totally conserved the initial activity of the 
corresponding unmodified wild-type or recombinant 
proteins, such as antibodies, antigens, enzymes or the 
biologically active fragments thereof known to be used 
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in diagnostic processes are, specifically, included in 
the proteins according to the invention. 

Similarly, protein complexes formed from a 
protein according to the invention and a chemical or 
5 biochemical compound, such as complexes comprising an 
antibody, an antigen or an oligonucleotide probe 
coupled to an enzyme, to a substrate or to a molecule 
capable of being detected, are included in the protein 
complexes according to the invention. 
10 Among the diagnostic processes according to the 

invention, mention may be made, for example , of the 
processes comprising the following steps: 

a) bringing the biological sample which may 
s «% contain the compound sought into contact with a protein 

%Q 15 or a protein complex according to the invention, said 
r% protein or protein complex possibly being in particular 

Q attached to a solid support; and 

M; b) revealing, identifying, locating and/or 

assaying the complex formed between the compound sought 

2 0 and a protein or a protein complex according to the 

invention. 

UJ Those skilled in the art will be able to adapt 

Si the known standard diagnostic processes to the proteins 

j=s or the protein complexes according to the invention. 

25 The specific techniques and reagents which 

allow the complex formed to be revealed, identified, 
located and/or assayed, and which may be used in the 
processes of the invention, are also well known to 
those skilled in the art and are, for example, the 

3 0 ELISA, RIA, immunofluorescence or PCR techniques, or 

other techniques for amplifying a target nucleic acid 
which are known to those skilled in the art . 

The invention also relates to a diagnostic kit 
or pack, in particular for specifically detecting, 
35 identifying, locating and/or assaying proteins or 
polynucleotides, characterized in that it contains a 
protein or a protein complex according to the 
invention. 
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The invention also relates to the use of a 
protein, of a protein complex or of a cell according to 
the invention, for preparing a pharmaceutical or 
cosmetic composition . 
5 Finally, a subject of the invention is a 

pharmaceutical or cosmetic composition comprising a 
protein, a protein complex or a cell according to the 
invention . 

Other characteristics and advantages of the 
10 invention appear in the remainder of the description 
with the examples hereinafter. 



EXAMPLES 

EXAMPLE 1: Construction of an E. coli strain 
15 including a Cys— »Val missense mutation at the 
thymidylate synthase active site and creating a 
nutritional requirement for thymine, thymidine or 
cysteine. 

The artificial alleles of the thyA gene are 
20 constructed by site-directed mutagenesis of the plasmid 
pTSO (Lemeignan et al . 1993), which derives from the 
plasmid pTZ18R (BioRad) by insertion of the wild-type 
thyA gene of E. coli. The site-directed mutagenesis 
using an oligonucleotide is carried out according to 
25 the method described by Kunkel et al . (1987) on the 
phagemid pTSO . The preparation of the single-stranded 
matrix of pTSO, amplified in the strain R21032 (Kunkel 
et al., 1987) (Hfr KL16 P045 [lysA (61-62) ] dutl ungl 
thil relAl supE44 zbd-279 : : TnlO) is carried out 
3 0 according to the protocol described by Sambrook et al . 
(1989). A 5 ' -phosphorylated oligonucleotide (purchased 
from the company Genome Express) is used as a mutagenic 
primer : 

OligodeoVynucleotide 1 : 

5 ' pTGGAT^AATGGCGCTGGCACCGGTACATGCATTCTTCCAGTTCTATGT 

The hybridization of this oligonucleotide with 
the single-stranded matrix in each of the two 
constructions is carried out with 10 ng of 
oligonucleotide and 0 . 2 \xl of matrix in a volume of 
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10 |il of a buffer solution containing 20 mM of Tris- 
HC1, pH 7.5, 2 mM of EDTA and 50 mM of sodium chloride. 
The tubes are incubated for 5 min at 7 0 °C, and then 
gradually cooled to 3 0°C. 0.5 mM of each of the dNTPs, 
5 1 mM of ATP, 10 mM of Tris-HCl at pH 7.5, 10 mM of 
magnesium chloride, 2 mM of dithiothreitol and 1 unit 
of each of both the T4 phage DNA ligase and DNA 
polymerase enzymes are then added to this mixture. This 
reaction mixture, with a final volume of 20 |ul , is 
10 incubated for 60 min at 37°C, and then 5 |il of this are 
then used to transform competent cells of the E. coli 
K12 strain GT869 (Parsot, C. 1986) (thrB1004 pro thi 
strA hsdS lacZ AM15 [F' lacZ AM15 laclq traD36 proA+ 
proB+] ) according to the method described by 
y3 15 Sambrook et al . (1989). The transformed cells are 
r\ plated out onto Petri dishes containing LB medium 

Q supplemented with 100 mg/1 of carbenicillin . Twelve 

j£j clones resistant to the antibiotic are reisolated on 

y[| the same medium. The single -stranded DNA corresponding 

^ 2 0 to the phagemids of these clones is prepared and 
J= sequenced according to the dideoxy method (Sanger et 

W al . , 1977). The M13 sequencing kit (Boehringer 

S Mannheim, Mannheim, Germany) and the deoxyadenosine 

5 '- (a- thio) triphosphate (1300 Ci/mmol, Amersham) are 
25 combined according to the suppliers' indications. Four 
primers are used to determine the sequence of the thyA 
alleles : 



Oligodeoxynucleonide 3 



C^^li9°deoxynucleot jkle 4 
3 0 Dligodeoxynucleoticke 5 
Oligodeoxynucleotide\6 



5 ' GGTGTGATCATGATGGTC 



5 ' CCTGCAAGATGGATTCCC 
5 ' CGCGCCGCATTATTGTTTC 
5 ' GTCTGGACCGGTGGCGACA 
The plasmid pTSl thus obtained propagates the 
thyA:Vall46 allele, in which position 146 occupied in 
the wild-type thyA gene by the UGC codon of cysteine is 
35 occupied by the GUA codon of valine. The plasmid pTSl 
is introduced by transformation, carried out according 
to the method of Sambrook et al . (1989), into the E. 
coli K12 AthyA strain, 01308 (Lemeignan et al . , 1993), 
in which the chromosomal thymidylate synthase gene, 
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thyA, is deleted. The transformed strain carrying the 
thyA:Vall46 plasmid allele, P5366, is shown to be 
incapable of growing without thymine or thymidine being 
added to the culture medium, as is the strain P13 08 
5 from which it derives . On the other hand, the strain 
p5366 shows marginal growth at 30 °C on a cysteine 
diffusion gradient prepared in Petri dishes containing 
25 ml of glucose MS mineral medium, starting from a 
central well containing 0 . 1 ml of a 400 mM L-cysteine 
10 solution. Under the same conditions, the strain P1308 
gives rise to no detectable growth. Thus, the missense 
mutation which converts the catalytic cysteine at 
position 146 of thymidylate synthase to valine can be 
^ partially suppressed by supplying very large amounts of 

*S 15 exogenous cysteine. The addition of 0.1 mM of valine to 
rl the medium of the Petri dishes abolishes the growth of 

O the strain P5366 on a cysteine gradient. Thus, it is as 

though the cysteine can infiltrate into the valyl-tRNA 
yg synthetase active site to form erroneous Cys-tRNAs Val 

20 capable of correcting the substitution of the cysteine 

S E 

jg in the thymidylate synthase active site, with valine. 

UJ The excess of valine would prevent the formation of 

™ these erroneous Cys- tRNAs Val . 

25 Example 2: Construction of an E. coli strain 

including a Cys— >Ile missense mutation at the 
thymidylate synthase active site and creating a 
nutritional requirement for thymine, thymidine or 
cysteine . 

3 0 The corresponding construction is also carried 

out in order to replace the cysteine at position 146 of 
thymidylate synthase, by site-directed mutagenesis 
using oligonucleotide 2, according to the same protocol 
as in Example 1. 
35^ OligodeoxVnucleotide 2: 

5 ' pTGGATAAAATGGCGCTGGCACCGATACATGCATTCTTCCAGTTCTATGT 

The plasmid pTS2 thus obtained propagates the 
thyA:Ilel46 allele in which position 146 occupied in 
the wild-type thyA gene by the UGC codon of cysteine is 



3 5\ 
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occupied by the AUA codon of isoleucine. The strain 
which propagates the thyA:Ilel46 plasmid allele, £5274, 
is shown to require the nutritional supply of thymine, 
of thymidine or of cysteine in excess, as was the 
5 strain 05366. The phenotypic suppression of the strain 
p5274 by the cysteine is abolished with 0.1 mM of 
isoleucine, as was that of the strain 05366 with 
valine. Thus, it is as though the isoleucyl - tRNA 
synthetase is capable of forming erroneous Cys - tRNAs Ile 
10 in the presence of an excess of cysteine, and that this 
erroneous formation is prevented by the presence of an 
excess of isoleucine. 

Example 3: Selection of genetic code mutants 

15 which misincorporate cysteine instead of valine, by 
serial culturing in liquid, and genetic 

characterization of the valyl-tRNA synthetase mutants 
thus obtained . 

The strain (35366 carrying the thyA:Vall46 

20 missense allele on the plasmid pTSl is cultured in 
glucose MS mineral medium (2 g/1, Richaud et al . , 1993) 
supplemented with 0.3 mM of thymidine, for 24 h at 3 0°C 
in aerobiosis. The cells are then washed twice with 
deoxygenated MS mineral medium. A deoxygenated nutrient 

2 5 medium containing 10 ml of glucose MS mineral medium 
supplemented with 1.5 mM of cysteine is inoculated at 
1/100 using the washed cells. The cells are then 
cultured in anaerobiosis for 24 h at 30°C, and a fresh 
tube containing 10 ml of deoxygenated cysteine glucose 

30 MS mineral medium is inoculated with a 1/100 dilution 
of the previous culture in the stationary phase. This 
procedure is repeated 26 times. At the end of this 
serial propagation, 12 clones from the liquid culture 
are isolated on plates containing thymidine (0.3 mM) 

35 glucose (2 g/1) MS mineral medium, in aerobiosis, and 
stored in suspension in the same liquid medium at 
-80°C. The twelve clones are tested on plates 
containing glucose MS mineral medium supplemented with 
nutrient factors. All these clones are shown to require 
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thymine or thymidine as a growth factor, unless 
cysteine is present in the culture medium, at a 
concentration of at least 1.5 mM. 

Two such clones are chosen for a thorough 
5 genetic characterization, £8144 and 08146. Experiments 
involving transduction, using the PI phage, of the 
kanamycin resistance characteristic, introduced into 
the nrdD locus neighboring the valS gene of valyl-tRNA 
synthetase (97 min of the E. coli K12 chromosome) , are 
10 carried out using the strains p8144 and (38146. In both 
cases, approximately half the kanamycin-resistant 
transductants also show nutrient dependency for 
thymidine, which can be suppressed with exogenous 
*«§ cysteine at the concentration of at least 1.5 mM. This 

C 15 proportion is in agreement with the genetic distance 
TTi between the valS and nrdD genes (0.4 min) and implies 

Q that the phenotype of suppression of the Cys— »Val 

%: missense mutation at the thymidylate synthase active 

ifcO * ' Cit-Q with low concentrations of cysteine is caused by 

!L 20 the genetic modification of the valS locus. 
jE The establishment of a genetic modification in 

y~j the valS gene of the modified strains is confirmed by 

JSj sequencing this locus : an A changed to a C causes the 

H replacement of the lysine at position 277 with 

25 glutamine in the two modified strains P8144 and 08146. 
The sequencing is performed on a matrix obtained via a 
polymerase chain reaction (PCR) carried out under 
conditions described by Sambrook et al . (1989). The 
amplification reaction is carried out in 100 |ul of a 
30 solution containing 10 ng of genomic DNA of the strain 
P8144 or p8146, 20 pmol of each primer, 40 nmol of an 
equimolar mixture of the 4 deoxynucleotide 
triphosphates, and 10 |ul of a buffer composed of 100 mM 
Tris-HCl pH 8.3, 500 mM KC1 and 2 0 mM MgCl 2 , in the 
35 presence of 1 to 2 units of Vent polymerase (Biolabs) . 
For each reaction, 3 0 polymerization cycles are 
performed, using a DNA amplifier ( Perkin-Elmer Cetus) , 
as follows: the denaturation is carried out at 94 °C for 
5 min for the 1st cycle and 1 min for the subsequent 



cycles, the hybridization at 58 °C for 1 min and the 
elongation at 72 °C for 3 min for the first 2 9 cycles 
and for 10 min for the last cycle. Oligonucleotudes 7 
and 8 are used for amplifying the gene. 
Oligodeoxwiucleotide 7 : 
5 ' GGGGAATTQGGTGTGTGAAATTGCCGCAGAACG 
'oligodeoxyniicleotide 8 : 
5 ' GGCAAGCTTCCAGTATTTCACGGGGAGTTATGC 

The PCR fragments thus obtained are purified 
using the QIAquick kit (Qiagen) and sent to the company 
Genaxis in order to determine the sequence thereof. 

Example 4: Phenotypic suppression by metabolic 
precursors of cysteine. 

The nutrient requirement for cysteine of the 
modified strains 08144 and 08146 is exploited in order 
to characterize metabolic precursors which may 
substitute for cysteine in the culture medium without 
giving rise to degradation by oxidation. S-carbamyl -L- 
cysteine (3 mM) , S -methyl -L-cysteine (3 mM) and 
L- thiazolidine-4 -carboxylic acid (2 mM) prove to be 
capable of replacing cysteine as a growth factor for 
the modified strains 08144 and 08146, instead of 
thymidine or thymine. The same compounds prove to be 
capable of satisfying the cysteine requirement of a 
cysN::kan mutant (strain JT1 , provided by M. Berlyn, 
Coli Genetic Stock Center, Yale University, USA (Levh 
et al . , 1988)). However, the addition of none of these 
substances allows growth of the strain 01308 carrying a 
chromosomal deletion of the thyA gene of thymidylate 
synthase, thus excluding their contamination with 
traces of thymine or of thymidine. 

Example 5: Selection of genetic code mutants 
which misincorporate cysteine instead of valine, by 
isolation on solid medium, and genetic characterization 
of the valyl-tRNA synthetase mutants which 
misincorporate cysteine . 
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The strain 05366 carrying the thyA:Vall46 
missense allele on the plasmid pTSl is transduced with 
a PI phage lysate harvested on an E. coli auxiliary 
strain (07170, Bouzon et al . , 1997) into the chromosome 
5 of which a kanamycin resistance marker had been 
introduced at the nrdD locus, neighboring the valS 
locus of the valyl-tRNA synthetase gene, thus producing 
the strain 05419. A mutator allele of the dnaQ gene is 
introduced extemporaneously by transduction of the 
10 strain 05419 using a PI phage lysate harvested on an 
auxiliary strain (MS2131, Shapiro 1990) carrying a 
tetracycline resistance marker dnaQ : :miniTnlO . Such a 
tetracycline-resistant clone showing a spontaneous 
„ mutation rate amplified approximately 1000-fold (for 

%Q 15 the acquisition of streptomycin resistance) is cultured 
at 30°C in glucose minimum medium in the presence of 
O thymidine (0.3 mM) . After 24 h, the cells are harvested 

and washed twice in an identical volume of culture 
yg medium without thymidine. A 0,1 ml volume of the 

s 2 0 resulting suspension, corresponding to approximately 
10 8 bacteria, is plated out on the surface of a series 
W of Petri dishes containing a concentration of S- 

=: carbamyl - L - cysteine ranging between 0 and 8 mM by 1 mM 

y, increment, adding the glucose (2 g/1) MS mineral 

2 5 medium. The same procedure is applied to the non- 
mutator strain 05419, to the wild- type dnaQ gene. All 
the Petri dishes are incubated for 96 h at 30°C. 
Colonies appear on the dishes having a concentration of 
S-carbamyl -L-cysteine which exceeds 2 mM, only when the 
30 dnaQ : : miniTnlO mutator allele has been introduced into 
the strain tested. 

A PI phage lysate harvested from such a clone 
is used to transduce the strain 053 66 carrying the 
thyA:Vall46 plasmid allele. Approximately half the 
35 kanamycin-resistant transductants are shown to be 
capable of growing in the presence of 3 mM of S- 
carbamyl -L-cysteine and in the absence of thymine or of 
thymidine, among which transductants, the strain 05455. 
The other half of the transductants is incapable of 



this and requires thymine or thymidine in order to 
proliferate, as does the strain p5366. This proportion 
between the phenotypes is in agreement with the genetic 
distance between the valS and nrdD loci (0.4 min) . 
Thus, the suppression of the Cys— »Val thyA missense 
mutation with a low concentration of exogenous cysteine 
might result from modification of the valyl-tRNA 
synthetase gene. The valS locus of one of the strains 
obtained by transduction of P5366 and capable of 
growing in the presence of 3 mM of S-carbamyl-L- 
cysteine and in the absence of thymine or of thymidine, 
referred to as P5455, is amplified by polymerized chain 
reaction and sequenced as described in Example 3. An A 
changed to a C causes the replacement of the threonine 
at position 222 with proline, thus confirming the 
establishment of a genetic modification in the valS 
gene of the strain P5455. 

Example 6: Sensitivity of the valyl-tRNA 
synthetase mutants to noncanonical amino acids. 

The strains p5455, p8144 and p8146 are tested 
for their sensitivity to artificial amino acids which 
show steric resemblance to valine. The test is carried 
out on dishes containing glucose of MS mineral medium 
supplemented with thymidine. The cells are cultured in 
aerobic medium (glucose MS mineral medium, 0.3 mM 
thymidine) for 24 h at 30°C and diluted to 1/250 in MS 
mineral medium. 0 . 5 ml of this cell suspension is 
plated out onto Petri dishes containing 25 ml of 
glucose MS mineral medium. A well is then hollowed out 
at the center of the dish and filled with 0 . 1 ml of an 
amino acid solution: 

(1) 100 mM L-2-aminobutyrate 

(2) 100 mM L-2-aminovalerate 

(3) 100 mM L-2 , 3 -diaminopropionate 

(4) 50 mM L-3-thiol-2-aminobutyrate . 

The dishes are then incubated for 24 h at 3 0°C 
and the possible appearance on the dishes of a zone of 
inhibition around the well is recorded. The diameters 
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of the limited zones of growth inhibition on Petri 
dishes are measured: 

L-2-aminobutyrate : 5.2 cm (p5455) , 5.7 cm (P8144), 
6.7 cm (p8146) ; 

5 L-2-aminovalerate : 2.1 cm (P5455) , 1.5 cm (P8144), 
6.7 cm (p8146) ; 

L-2 , 3 -diaminopropionate : 2.3 cm (P5455) , 2.7 cm 
(p8144) , 1. 9 cm (p8146) ; 

L-3-thiol-2-aminobutyrate : 2.0 cm (P5366) , 4.6 cm 
10 (p5455) , 4.0 cm (08144), 4.0 cm (P8146) . 

L-2-aminobutyrate, L-2 -aminovalerate and L-2,3- 
diaminopropionate , at the concentrations indicated, 
have no effect on the strain P5366 containing the wild- 
», tyP e valS allele, but inhibit the growth of the strains 

yg 15 carrying a mutated valS gene. L-3-thiol-2-aminobutyrate 
p=J inhibits the growth of all the strains, but a more 

p significant inhibition may be noted on the mutated 

strains. Thus, it is as though the valyl-tRNA 
synthetase mutants have a broadened specificity making 

a 2 0 them capable of charging tRNAs Val with amino acids which 

Q 

~^ cannot be incorporated by the wild-type form of the 

ui enzyme . 

lT Example 7: Incorporation of the noncanonical 

25 amino acid ae-aminobutyrate into the proteins of an E. 
coli strain mutated in valyl-tRNA synthetase. 

A PI phage lysate obtained from the strain 
P5455 (see Example 5) , was used to transduce the strain 
CU505 carrying an £ilvCABD deletion and a leu mutation 
3 0 making it auxotrophic for valine, isoleucine and 
leucine. The strain CU505 was obtained from the Coli 
Genetic Stock Center, at Yale University (USA) . 
Transductant clones were selected on kanamycin LB 
plates and tested for their sensitivity to 
35 aminobutyrate (3 mM) in solid glucose (2 g/1) MS medium 
containing 0.3 mM of each of the three amino acids 
valine, isoleucine and leucine. Approximately 50% of 
the transductant clones could not grow under these 
conditions, indicating the cotransduct ion of the 
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valS:T222P allele and the resistance marker nrdD::kan 
(see Example 5) . One of the transductant clones, 
referred to as 05498, was used to demonstrate the 
incorporation of aminobutyrate as a replacement for 
5 valine, in comparison with CU505. The two strains were 
cultured at 30°C in glucose (2 g/1) MS liquid medium 
containing the Ile-Leu dipeptide at the concentration 
of 0.3 mM and the Ile-Val dipeptide at the 
concentration of 0.02 mM, either in the presence of 
10 0.2 mM of L- aminobutyrate or in the absence of the 
analog. The inoculum corresponding to each strain 
originated from a preculture in glucose (2 gl) MS 
liquid medium containing the Ile-Leu dipeptide at the 
concentration of 0 . 3 mM and the Ile-Val dipeptide at 
J3 15 the concentration of 0.04 mM. The cultures (50 ml) in 
Y!\ stationary phase after 24 h at 30 °C were harvested by 

p centrif ugation . For each test, the pellet was then 

resuspended in 25 ml of a solution of trichloroacetic 
gg acid at 100 g/1 (10% TCA) at 4°C, centrifuged, 

s 20 resuspended in 5 ml of 10% TCA, centrifuged again, the 
]p pellet resuspended in 5% TCA, the suspension incubated 

UJ at 95°C for 30 min and centrifuged, the pellet 

52 resuspended in 5 ml of acetone and centrifuged, the 

M= pellet resuspended in 5 ml of acetone and centrifuged, 

2 5 the pellet resuspended in 5 ml of acetone and 

centrifuged, and the pellet left to dry. The residue 
thus obtained was dissolved in 1 ml of a solution of 
NH4HCO3 at 5 0 mM in order to be lyophilized. The 
lyophilizate was dissolved in 2 ml of 6N hydrochloric 

3 0 acid containing 2 g/1 of phenol, and the mixture was 

sealed in a vial and then incubated at 110°C for 20 h. 
The amino acid concentration of the hydrolysate was 
then quantified by derivatization with ninhydrin 
according to the instructions recommended by the 
35 supplier of the Beckman 6300 analyzer. The 
aminobutyrate was detected in the protein hydrolysate 
only when the aminobutyrate had been added to the 
culture medium, and only for the strain P5498. The 
aminobutyrate proportion replaced a quarter of the 
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amount of valine, corresponding to approximately 5 
aminobutyrate residues for 100 amino acids of the total 
proteins. The detailed results of the analyses for the 
two strains CU505 and P54 98, under the two culture 
5 conditions, are given in the table below. 



Chemical composition of the proteins extracted from 
10 strains auxotrophic for valine and cultured with valine 
limiting, with or without aminobutyrate 



CP 



Amino acid 


- CU505 


P5498 


CU5 05 


P5498 


incorporated 


wt valS 


valS T222P 


wtvalS 


valS T222P 


into the 


-Abu 


-Abu 


+Abu 


+ Abu 


proteins 










Abu 


0 


0 


0 


0 .20 


Val 


0 . 83 


0 . 79 


0 . 83 


0 . 61 


Val + Abu 


0 . 83 


0 . 79 


0. 83 


0 . 81 


Ala 


1 . 32 


1 .28 


1 . 32 


1 . 22 


He 


0 . 61 


0 . 61 


0 . 61 


0 . 61 



U 15 



20 



25 



30 



Results expressed in Leu equivalents 

Example 8: Selection of novel genetic code 
mutants using a mutator strain, for isolation on solid 
medium . 

The strain P5419, expressing the thyA:Vall46 
inactive allele on a plasmid and carrying the marker 
£nrdD::kan in the chromosome, as reported in the 
construction thereof described in Example 5, was 
transduced using a PI phage lysate harvested on the 
strain TAD, carrying a mutator marker £mutS::spc, 
conferring spect inomycin resistance, selecting on LB 
solid medium containing spectinomycin (25 mg/1) so as 
to obtain the strain P5555. The mutator phenotype of 
this strain was demonstrated by counting the frequency 
of rif amycin-resistant mutants. Following the 
experimental procedure described in Example 5, clones 
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capable of growing at 30 °C in glucose mineral medium 
without thymidine, in the presence of 2 to 5 mM of S- 
carbamoyl-L- cysteine (SCC) , were obtained. Three of 
these clones were used to prepare PI phage lysates 
5 which were used to transduce the strain p53 66, 
selecting for kanamycin resistance, according to the 
procedure of Example 5. For each of the three lysates, 
approximately half the transductants were capable of 
growing in solid glucose mineral medium containing 3 mM 
10 of SCC, indicating the proximity of a mutation 
suppressing the thyA:C146V missense allele and of the 
marker nrdD::kan. The valS locus of the three strains 
p5479, (35485 and £5486, each corresponding to an SCC- 
n suppressible transductant obtained from one of the 

15 three lysates, was amplified by PCR and sequenced as 
Yi described in Example 3. A different point mutation was 

□ found for each of the three strains, namely Arg 223 

jL 1 changed to His in the strain [35479, Val 276 changed to 

yg Ala in the strain P5485 and Asp 230 changed to Asn in 

s_ 20 the strain P5486. Thus, each clone exhibiting a 

Q 

^ phenotype of suppression of the Cys 146 Val missense 

bJ mutant of thyA also exhibits the aminobutyrate 

sensitivity. Each one of these clones was shown to 

carry a different point mutation in the valS gene, 
25 validating the selective screen as a means of 

diversifying the activity of valyl-tRNA synthetase in 

Escherichia coli. 
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